AITopics | multivariate gaussian

Amortized Bayesian inference (ABI) offers fast, scalable approximations to posterior densities by training neural surrogates on data simulated from the statistical model. However, ABI methods are highly sensitive to model misspecification: when observed data fall outside the training distribution (generative scope of the statistical models), neural surrogates can behave unpredictably. This makes it a challenge in a model comparison setting, where multiple statistical models are considered, of which at least some are misspecified. Recent work on self-consistency (SC) provides a promising remedy to this issue, accessible even for empirical data (without ground-truth labels). In this work, we investigate how SC can improve amortized model comparison conceptualized in four different ways. Across two synthetic and two real-world case studies, we find that approaches for model comparison that estimate marginal likelihoods through approximate parameter posteriors consistently outperform methods that directly approximate model evidence or posterior model probabilities. SC training improves robustness when the likelihood is available, even under severe model misspecification. The benefits of SC for methods without access of analytic likelihoods are more limited and inconsistent. Our results suggest practical guidance for reliable amortized Bayesian model comparison: prefer parameter posterior-based methods and augment them with SC training on empirical datasets to mitigate extrapolation bias under model misspecification.

likelihood, marginal likelihood, sc training, (13 more...)

arXiv.org Machine Learning

2512.14308

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York > Rensselaer County > Troy (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Transportation (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.35)

Add feedback

a935ba2236c6ba0fb620f23354e789ff-Supplemental-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 04:09:12 GMT

artificial intelligence, covariance, machine learning, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 21:12:31 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. Authors propose a method of estimating a graphical model for continuous data that blends the following three, established ideas: 1) assume the data follows a multivariate Gaussian and estimate using the graphical lasso; 2) do not assume the data follows a multivariate Gaussian and instead use a Gaussian copula, the nonparanormal, to allow arbitrary single variable marginals; or 3) assume a specific tree-structured factorization and model arbitrary bivariate marginals along the tree structure. The proposed method introduces the blossom tree, which is a specific factorization of the model into a collection of densely connected blossom components that are connected by a specific set of tree edges. In particular, each blossom is connected (via a pedicel node) to at most one tree edge. The blossom components are modeled as sparse multivariate Gaussians (or using the non-paranormal copula) and the tree edges are modeled as arbitrary bivariate distributions with single variable marginals that are consistent with the marginal of any blossom pedicel to which they are attached.

blossom tree, factorization, graphical model, (11 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Genre: Overview (0.35)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

45f31d16b1058d586fc3be7207b58053-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 19:37:39 GMT

artificial intelligence, machine learning, newton algorithm, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

08fa43588c2571ade19bc0fa5936e028-Supplemental.pdf

Neural Information Processing SystemsOct-1-2025, 23:42:39 GMT

artificial intelligence, generator, machine learning, (17 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

submit

Amir

Neural Information Processing SystemsOct-1-2025, 22:03:04 GMT

artificial intelligence, counterfactual distribution, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

A Proof of Proposition 1 We first follow the proof of the log-sum inequality to prove the following inequality: q u (y |D r) log q u (y |D

Neural Information Processing SystemsAug-16-2025, 01:23:05 GMT

Define the function f (t) null t log t which is convex. This section discusses the sparse GP model that is used in the classification of the synthetic moon dataset in Sec. A GP is fully specified by its prior mean (i.e., assumed to be Given the latent function values (i.e., also known as inducing variables) On the other hand, Figs. 9 and 10 visualize the approximate posterior beliefs Let us consider the experiment in Sec. Figure 1 shows results of averaged KL divergences (i.e., performance metric described in Sec. 4) achieved by EUBO, rKL, and However, the fourth row of Table 3 shows that both EUBO and rKL do not perform that well. EUBO may suffer from poor unlearning performance when λ is too small. One may wonder how our unlearning methods can handle multiple users' request arriving sequentially Figure 12: Graphs of averaged KL divergence vs.

approximate posterior belief, kl divergence, posterior belief, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

Filters

Collaborating Authors

multivariate gaussian

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

a935ba2236c6ba0fb620f23354e789ff-Supplemental-Conference.pdf

b8a6550662b363eb34145965d64d0cfb-Supplemental.pdf

6801fa3fd290229efc490ee0cf1c5687-Supplemental-Conference.pdf

Improving the Accuracy of Amortized Model Comparison with Self-Consistency

a935ba2236c6ba0fb620f23354e789ff-Supplemental-Conference.pdf

Export Reviews, Discussions, Author Feedback and Meta-Reviews

45f31d16b1058d586fc3be7207b58053-AuthorFeedback.pdf

08fa43588c2571ade19bc0fa5936e028-Supplemental.pdf

submit

A Proof of Proposition 1 We first follow the proof of the log-sum inequality to prove the following inequality: q u (y |D r) log q u (y |D